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Abstract 

Background: Mutations in PEX1 are the most common primary cause of Zellweger syndrome. In addition to 
exonic mutations, deletions and splice site mutations two 5' polymorphisms at c-137 and c.-53 with a potential 
influence on PEX1 protein levels have been described in the 5' untranslated region (UTR) of the PEXI gene. 

Methods: We used RACE and in silico promoter prediction analysis to study the 5' UTR of PEX1. We determined 
the distribution of PEXI 5' polymorphisms in a cohort of 30 Zellweger syndrome patients by standard DNA 
sequencing. 5' polymorphisms were analysed in relation to the two most common mutations in PEX1 and were 
incorporated into a novel genotype-phenotype analysis by correlation of three classes of PEX1 mutations with 
patient survival. 

Results: We provide evidence that the polymorphism 137 bp upstream of the ATG codon is not part of the UTR, 
rendering it a promoter polymorphism. We show that the first, but not the second most common PEX1 mutation 
arose independently of a specific upstream polymorphic constellation. By genotype-phenotype analysis we 
identified patients with identical exonic mutation and identical 5' polymorphisms, but strongly differing survival. 

Conclusions: Our study suggests that two different types of PEX1 5' polymorphisms have to be distinguished: a 5' 
UTR polymorphism at position c-53 and a promoter polymorphism 137 bp upstream of the PEX1 start codon. Our 
results indicate that the exonic PEX1 mutation correlates with patient survival, but the two 5' polymorphisms 
analysed in this study do not have to be considered for diagnostic and/or prognostic purposes. 



Background 

Peroxisome biogenesis disorders (PBD, MIM #601539) 
are a group of autosomal recessive diseases characterized 
by a severe developmental and metabolic phenotype [1]. 
PBD comprise the Zellweger syndrome (ZS) spectrum 
and rhizomelic chondrodysplasia punctata. The ZS spec- 
trum consists of three overlapping clinical phenotypes 
representing a continuum of disorders, ranging form 
infantile Refsum disease (IRD, MIM #266510) as the 
mildest variant, to neonatal adrenoleukodystrophy 
(NALD, #MIM 202370) and to ZS (MIM #214100) as the 
prototype and most severe form with a life expectancy of 
less than one year. ZS patients often show characteristic 
dysmorphic features and severe neurodevelopmental 
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brain abnormalities like polymicrogyria and pachygyria. 
Clinically patients present with muscular hypotonia and 
poor feeding, seizures, eye abnormalities, liver dysfunc- 
tion and skeletal defects. 

ZS is caused by mutations in PEX genes, encoding pro- 
teins (peroxins) required for the biogenesis of peroxi- 
somes [1,2]. Peroxisomes are ubiquitous cellular 
organelles with functions in ether lipid biosynthesis, a- 
oxidation of phytanic acid, P -oxidation of very long chain 
fatty acids (VLCFAs) and of bile acid precursors and doc- 
osahexaenoic acid (DHA) precursors, and in hydrogen 
peroxide metabolism [3] . Peroxisomes are versatile orga- 
nelles which can be induced and regulated according to 
metabolic needs [4]. In humans, 16 evolutionarily con- 
served peroxins are known, mutations in 13 of which are 
associated with disease. The peroxins PEX5 and PEX7 
are cytosol-based import receptors for peroxisomal 
matrix proteins. The import receptor PEX5 loaded with 
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matrix proteins can associate with most peroxisomal pro- 
teins in the cytosol and is involved in the translocation of 
these proteins into the peroxisome. According to one 
model, a downstream complex of two ATPases, PEX1 
and PEX6, and the peroxisomal membrane protein 
PEX26 are thought to mediate recycling of PEX5 into the 
cytosol [5,6]. This process depends on ubiquitination of 
PEX5 during the translocation process at the peroxisomal 
membrane. Models of peroxisome protein import have 
been reviewed recently [7,8]. 

PEX1 and the paralogous PEX6 gene encode members of 
the AAA (ATPases associated with diverse cellular activ- 
ities) family of proteins. PEX1 and PEX6 comprise two 
AAA domains. The PEX1 gene spans about 41.5 Mbp and 
is composed of 24 exons, encoding an mRNA of 4.4 kbp. 
Mutations in PEX1 are the most common primary cause of 
Zellweger syndrome and account for roughly two thirds of 
all PBD patients [9-14]. The most commonly found muta- 
tion in PEX1 is C.2528G > A leading to an exchange of Gly 
to Asp in the PEX1 protein [15-17]. The second most fre- 
quent mutation is the insertion 2097_2098insT leading to a 
frameshift and nonsense mutation after 1700 [9,18]. These 
common PEX1 mutations have been included in genotype 
phenotype analyses showing that the missense mutation is 
associated with the mild end of the clinical spectrum and 
the insertion with more severly affected patients [1]. This 
correlation proves true in the majority of cases, but excep- 
tions can be observed in which same genetic constellation 
leads to largely divergent phenotypes. Such cases make it 
necessary to look for potential effects that modulate patient 
outcome on the basis of a given primary defect. More 
recently, two polymorphisms have been described 
upstream the PEX1 coding region. One is a C-to-G trans- 
version located 53 nucleotides upstream the PEX1 ATG 
(c.-53C > G). The other is a T-to-C transition, C.-137T > C. 
The c-137 T > C polymorphism resides in a potential in- 
frame ATG codon that could expand the PEX1 ORF by 
138 bp. This ATG, however, could be shown not to repre- 
sent a translational start codon [19]. Reporter studies by 
Maxwell et al. suggest that the -137C polymorphism results 
in reduced expression of PEX1 whereas the -53G poly- 
morphism increases expression of PEX1 [19]. A combina- 
tion of both polymorphisms results in near-normal 
expression of PEX1 in a reporter experiment [19]. Poly- 
morphisms modulating the PEX1 protein level could possi- 
bly modulate the disease phenotype of PEX1 patients. An 
analysis of polymorphisms and PEX1 mutations in 25 
PEX1 -deficient patients suggested that the C.-137T > C 
transition cosegregates with the C.-53C > G transversion. 
In addition, both polymorphisms were found to cosegre- 
gate with the 2097_2098insT mutation [19]. 

To evaluate the potential influence of known 5' poly- 
morphisms in PEX1 on the disease phenotype we analysed 
the 5' UTR in more detail. We provide evidence that the 



-137C polymorphism is not a 5' UTR but rather a promo- 
ter polymorphism. We included both polymorphisms in a 
genotype-phenotype analysis of 30 PBD patients with 
PEX1 mutations representing the whole spectrum of clini- 
cal phenotypes. 

Methods 

Collection of patients 

5'UTR polymorphisms of 30 patients with mutations in 
the PEX1 gene were analyzed. The patients had been 
referred to the Department of Pediatrics and Pediatric 
Neurology at the University of Gottingen for diagnostic 
evaluation of peroxisome biogenesis disorder and through 
the German LEUKONET. Informed consent has been 
obtained from guardians of all patients. The work pre- 
sented in this study was conducted under the approval of 
the local ethics committee. The clinical phenotype and 
plasma biochemical abnormalities pointed towards a per- 
oxisome biogenesis disorder and PEX1 gene mutations 
were identified by direct sequencing. PEX1 mutations of 
this patient cohort are previously published in part [12]. In 
some instances PEX1 deficiency was further confirmed by 
complementation analysis [14]. 

Extraction of genomic DNA from fibroblasts and blood 

Genomic DNA was extracted from 10 7 cells. After trypsi- 
nization, cells were washed in PBS, resuspended in 800 |il 
lysis buffer (50 mM Tris-HCl, pH 8.0, 100 mM EDTA, 
0.5% SDS) and treated with 50 ug proteinase K for 16 hrs 
at 55°C. DNA was extracted by a standard protocol using 
phenol-chloroform. Genomic DNA from blood was 
extracted using the DNeasy Blood&Tissue Kit (Qiagen, 
Hilden, Germany) following the instructions of the 
manufacturer. 

RNA extraction and cDNA-Synthesis (Reverse 
Transcription) 

RNA was extracted from fibroblasts using peqGOLD 
TriFast (Peqlab, Erlangen, Germany). For first strand 
synthesis we used the SuperScriptHI First-Strand Synth- 
esis System (Invitrogen, Darmstadt, Germany) with 1 ug 
total RNA together with 01igo(dT 2 o) primers according 
to the specification of the manufacturer. 

Determination of polymorphisms and haplotype analysis 

Single nucleotide polymorphisms at c-53 C > G und c- 
137 T > C in the PEX1 gene, and exonic mutations were 
analysed by sequencing after PCR amplification using the 
primers listed in the table in additional file 1. PCR reac- 
tions were performed in a Thermocycler T3000 (Biometra, 
Gottingen, Germany) and analysed on 1% agarose gels. All 
PCR products were purified using the High Pure PCR Pur- 
ification Kit (Roche, Grenzach-Wyhlen, Germany) follow- 
ing the instructions of the manufacturer. For DNA 
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sequencing the BigDyeTerminator v3.1 Cycle Sequencing 
kit (Applied Biosystems, Darmstadt, Germany) was used. 
Sequencing reactions were purified by DNA precipitation, 
dissolved in 10 ul Hi-Di formamid (Applied Biosystems) 
and analysed on a 3100-Avant Genetic Analyzer (Applied 
Biosystems). Abi Prism 3100 Genetic Analyzer Data Col- 
lection Software 2.0 (Applied Biosystems) and Laser Gene 
Sequencing Analysis Software 5.1. (Dnastar, Madison, WL 
USA) were used for sequence analysis. 

Rapid amplification of cDNA ends (RACE) 

The 573' RACE Kit (Roche) was used to characterize the 
transcriptional start of the PEX1 mRNA following the 
manufacturer's protocol. Briefly, PEX1 specific reverse pri- 
mers (SP1, SP2, SP3, respectively) and Reverse Transcrip- 
tase were used to synthesize cDNA extending to the 5' 
end of the PEX1 mRNA. The 3' end of the resulting 
cDNA was polyadenylated using Terminal Transferase. 
The resulting cDNA was PCR amplified using internal 
PEX1 specific primers (SP2, Pexlex2, SP1, respectively), 
and an oligo-dT anchor primer. This step was followed by 
a nested PCR (primers PeclO or SP2, respectively, and the 
anchor primer). PCR products were directly cloned into 
pGEM-T-Easy Vectors I (Promega, Mannheim, Germany) 
and sequenced using the T7 and SP6 sites of the vector. 
For primer sequences see table in additional file 1. 

Results and Discussion 

Definition of the PEX1 5'UTR 

Both 5' polymorphisms, c-137 T > C and c-53 C > G, 
have been described as 5 'UTR polymorphisms implying 
they become part of the mRNA. The PEX1 5'UTR was 
reported to extend up to c-245 according to PCR amplifi- 
cation studies [19]. In PCRs with cDNA prepared from cul- 
tivated primary human fibroblasts, however, we could not 
amplify cDNA with primers upstream of or including c- 
137 (primers 455H, 455K, 4551, and OST542 in Figure 1). 
Our analysis also included the primer used in a previous 
study [19], here designated OST542 (Figure 1). In contrast, 
all primers downstream of c-137 (primers 455C, 455, 
455G, 455E, 455F) could amplify the PEX1 cDNA. The 
most upstream functional primer extended up to c-123 
suggesting that the polymorphism at c-137 is not necessa- 
rily part of the mRNA. Identification of the PEX1 tran- 
scriptional start is of importance because a transcriptional 
start at around c-123 would render the upstream poly- 
morphism at c-137 a promotor polymorphism. For com- 
parison, a database entry marked c-96 as the PEX1 mRNA 
start (NCBI Reference Sequence NM_000466.2). To 
experimentally confirm the transcriptional start point of 
PEX1, we conducted a 5'RACE (rapid amplification of 
cDNA ends) experiment. The 5' region of PEX1 mRNA 
was reverse transcribed using a gene-specific internal pri- 
mer, PCR amplified in two successive rounds and cloned 



into a pGEM-T vector for sequencing. The longest 5' 
extension of the cDNA that could be identified by this 
approach extended to c-120, giving no further indication 
that the 5' UTR of PEX1 stretches as far as c-245. To gain 
further insight into the PEX1 5'UTR/promotor region, we 
searched the 500 bp upstream of the ATG for putative 
transcription factor binding sites using the weight matrix- 
based Match tool within the TRANSFAC database [20,21]. 
Results are summarized in the table in additional file 2. 
This analysis shows that the region upstream c-100 is cov- 
ered by GC box elements that are bound by one of the 
classical upstream transcription factors, Spl [22], also 
arguing against a transcriptional start at c-245. 

In summary, PCR and RACE experiments, PEX1 
cDNA database entries and a promoter analysis indicate 
that the PEX1 polymorphism at c-137 is not necessarily 
a polymorphic site in the 5' UTR but could also be a 
polymorphism within the promoter. It is of interest that 
242 bp upstream of the PEX1 start codon, the poorly 
characterized ORF C7orf64 starts, extending on the 
other DNA strand. Thus the PEX1 5' region, previously 
defined as UTR [19] coincides with the intergenic 
regions between PEX1 and C7orf64 and would comprise 
UTRs of both genes. 

5' Polymorphisms in a PEX1 patient population 

PEX1 upstream polymorphisms bear the potential to 
modulate the disease phenotype due to their influence on 
PEX1 expression levels. We assessed the presence or 
absence of the two known 5' polymorphisms at c-137 
and c-53 in a well-characterized PBD patient collection 
with PEX1 mutations [12]. PEX1 mutations in this 
patient collection are deletions (11 out of 60 alleles), 
insertions (12/60), and point mutations (34/60). In three 
patients, only one affected allele has been identified so far 
(Table 1). The two most common mutations are the mis- 
sense mutation c.2528 G > A in exon 15 (22 out of 60 
alleles) resulting in an amino acid exchange of glycine to 
aspartate (p.Gly843Asp or G843D), and the insertion 
c.2097_2098insT in exon 13 (10/60 alleles) leading to a 
frameshift after isoleucin in position 700 and a premature 
termination codon (PTC). Altogether, 20 of the 60 alleles 
include PTCs, leading to potentially more severe pheno- 
types. Four constellations of the 5' polymorphisms were 
identified in the patient collective of 30 patients. We 
found the common constellation (c-137 TT and c-53 
CC) in 13 (43%) of the patients and, likewise 11 (37%) 
heterozygous carriers (c-137 TC and c-53 GC). The 5' 
polymorphism at c-137 alone in combination with a 
wild-type allele at c-53 (c-137 TC and c-53 CC) 
appeared in three (10%) of the PEX1 patients. Another 
three (10%) were homozygous for both polymorphisms 
(c-137 CC and c-53 GG). The three patients with a c- 
137T > C transition without the C.-53C > G transversion 
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Figure 1 Analysis of the 5' region of PEX1 Schematic representation of the PEX1 5' region. Positions c-137, c-53 and the start codon are 
marked. Primers that in combination with the reverse primer Pec10 or PEX1- Ex 2 (Exon 2) give PCR products are marked in light gray (cDNA 
template). Primers that do not yield PCR products are marked in dark gray. In our analysis only primers downstream of c-137 could be used to 
amplify sequences from PEX1 cDNA. These results were confirmed by RACE. 



Table 1 Distribution of 5' polymorphisms in a cohort of PEX1 peroxisome biogenesis disorder patients 



Patient 


1st mutation 


Amino acids 


Exon 


horn/ 
net 


2nd mutation 


Amino acids 


Exon 


horn/ 
net 


Survival 
(months) 


c-137 
T > C 


c-53 
C > G 


ZS1 


C.2226+2T > C 


r.2072_2416del p. 
Ala691_Lys806 

Hp i n cf"^li i 

UCI 1 1 1 O^JIU 


intrl 3 


horn 










2 


TT 


CC 


ZS2 


c 274G > C 


n\/p>IQ7l pi i 

[J. V d 1 7ZLCU 


3 


horn 










23 


yj 


QQ 




c./uoj /uojue 


p.OyjUfcM IVlfcrl 


1 3 


horn 










D 
J 


TT 


rr 




Ljoy i joy^fue 


n r,ln1 931 HkfcY3 
p.ljlll I Zo I nlblbAj 




horn 










3 

J 


TT 


rr 


ZS5 


c 2383C > T 


|J.r\l LJ / 3JA 


1 A 


"i rv t 




p.VdlJJUnl y 1 iAjt 


Intron 
15 


het 


/] 


TJ 




ZS6 


C.3038G > A 


p.Arg1013His 


20 


het 


C.3287C > G 


p.Ser1096X 


21 


het 


5 


TT 


CC 


ZS7 


C.2528G > A 


p.Gly843Asp 


15 


het 


C.274-1G > C 


r.274_357del p. 
Val92_Leul19del 


Intron 
2 


het 


> 170 


TT 


cc 


ZS8 


C.2528G > A 


p.Gly843Asp 


15 


het 


2nd mutation? 








70 


TT 


cc 


ZS9 


C.2528G > A 


p.Gly843Asp 


15 


horn 










> 187 


TT 


cc 


ZS10 


C.2528G > A 


p.Gly843Asp 


15 


horn 










> 214 


TT 


cc 


ZS11 


C.2528G > A 


p.Gly843Asp 


15 


horn 












TT 


cc 


ZS12 


C.2528G > A 


p.Gly843Asp 


15 


horn 










> 240 


TT 


cc 


ZS13 


C.2528G > A 


p.Gly843Asp 


15 


horn 










2 


TT 


cc 


ZS14 


C.724G > A 


p.Val242lle 


5 


het 


2nd mutation? 








3 


TC 


GC 


ZS15 


c.94_96del 


p.Pro32del 


1 


het 


C.2981T > C 


p.Leu994Pro 


19 


het 




TC 


GC 


ZS16 


c.911_912del 


p.Ser304fsX2 


5 


het 


C.2387T > C 


p.Leu796Pro 


M 


het 


33 


TC 


GC 


ZS17 


C.2528G > A 


p.Gly843Asp 


15 


horn 










5 


TC 


GC 


ZS18 


C.2528G > A 


p.Gly843Asp 


15 


horn 












TC 


GC 


ZS19 


c.2097_2098insT 


p.lle700TyrfsX42 


13 


het 


c.l952_1960dup 


p.W653_M654ins 

TVW 


12 


het 


10 


TC 


GC 


ZS20 


c.2097_2098insT 


p.lle700TyrfsX42 


13 


het 


C.2528G > A 


p.Gly843Asp 


15 


het 




TC 


GC 


ZS21 


c.2097_2098insT 


p.lle700TyrfsX42 


13 


het 


C.2528G > A 


p.Gly843Asp 


15 


het 


> 214 


TC 


GC 


ZS22 


c.2097_2098insT 


p.lle700TyrfsX42 


13 


het 


C.3037C > G 


p.Arg10!3G 


20 


het 


3 


TC 


GC 


ZS23 


c.2097_2098insT 


p.lle700TyrfsX42 


13 


het 


C.3124A > C 


F 1042V 




het 


> 120 


TC 


GC 


ZS24 


c.2097_2098insT 


p.lle700TyrfsX42 


13 


het 


c.2916delA 


p.Gly973AlafsX16 


18 


het 


3 


TC 


GC 


ZS25 


C.2528G > A 


p.Gly843Asp 


15 


het 


c.1439delT 


p.Leu480TrpfsX2 


7 


het 


18 


TC 


CC 


ZS26 


C.2528G > A 


p.Gly843Asp 


15 


het 


2nd mutation? 










TC 


CC 


ZS27 


C.2528G > A 


p.Gly843Asp 


15 


het 


C.2614C > T 


p.Arg872X 


16 


het 




TC 


CC 


ZS28 


c.2097_2098insT 


p.lle700TyrfsX42 


13 


horn 










1 1 


cc 


GG 


ZS29 


c.2097_2098insT 


p.lle700TyrfsX42 


13 


horn 










5 


cc 


GG 


ZS30 


C.2528G > A 


p.Gly843Asp 


15 


het 


c.249insT 


p.Leu84SerfsX24 


2 


het 


3 


cc 


GG 



The table displays the 5' polymorphisms and the previously identified exonic mutations in 30 PEX1 patients. Mutations c.2528 G > A and c.2097_2098insT are 
especially common. Homozygosity (horn.) is inferred from the absence of heterozygosity in the sequencing reaction and the absence of a second site mutation 
in PEX1. In three patients, ZS 8, ZS 14, and ZS 26, the second mutation is not identified. The 5' polymorphism genotypes were determined by direct sequencing. 
Four possible constellations of the 5' polymorphisms were identified in the patient collective. Patients' survival is indicated where this information was available. 
> before number of months indicates patients that are alive and reached the respective age at the last follow-up in 2010. 
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indicated that the C.-137T > C transition can be present 
in the absence of the C.-53C > G transversion. This con- 
stellation was not present in the patient cohort analysed 
by Maxwell and colleagues [19]. On the background of 
this polymorphism constellation PEX1 expression could 
potentially be reduced to a stronger extent than caused 
by the patients' PEX1 mutations alone. The polymorphic 
constellations identified in this study (TT CC, TC GG, 
CC GG, and TC CC) are four of a total of nine possible 
constellations. These four constelations can be explained 
by existence of only three alleles: -137C -53G, -137T 
-53C, and the rare -137C -53C. This would be the most 
parsimonious, but still speculative interpretation. 

In sum, the allele frequency of the C.-137T > C poly- 
morphism in our patient cohort was 33% (20/60), com- 
pared to 14.9% in the European population (NCBI single 
nucleotide polymorphism (SNP) database). Interestingly, 
the allele frequency of the C.-53C > G polymorphism 
was 28% (17/60) in our cohort, whereas in the healthy 
population it is reported to range between 4.4 and 7.8% 
(NCBI SNP database). 

We next analysed the distribution of PEX1 5' poly- 
morphisms in relation to the two most common PEX1 
mutations. Table 2 shows the distribution of 5' poly- 
morphisms in relation to c.2097_2098insT. Of eleven 
heterozygous patients (c.-137 TC and c-53 GC), six 
were heterozygous for the insertion c.2097_2098insT. 
The remaining five patients did not carry the insertion. 
The three patients with the heterozygous polymorphism 
only at position c-137 (c.-137 TC and c-53 CC) did not 
carry the c.2097_2098insT insertion and both homozy- 
gous patients (c.-137 CC and c-53 GG) were homozy- 
gous for c.2097_2098insT. None of our patients with a 
wild-type configuration at both polymorphism sites (c- 
137 TT und c-53 CC) carries the c.2097_2098insT 
insertions. We could thus confirm that the second most 
common PEX1 allele, 2097_2098insT, is coupled to the 
c-137 T > C c-53 C > G constellation and probably 
evolved on the basis of this allele (Table 2). The evolu- 
tion of the 2097_2098insT allele on the basis of the c- 
137 T > C c-53 C > G constellation also explains the 



overrepresentation of the C.-53C > G polymorphism in 
our cohort. 

Table 3 shows the distribution of these polymorphisms 
in relation to the most common PEX1 mutation, p. 
Gly843Asp. The 13 patients with the common sequence 
at positions c-137 and c-53 and the 11 patients with the 
two heterozygous 5' polymorphisms distributed equally 
in relation to the p.Gly843Asp mutation. One heterozy- 
gous p.Gly843Asp patient was found to carry the rare 
homozygous c-137 CC c-53 GG constellation. This indi- 
cated that the p.Gly843Asp allele can be found in combi- 
nation with any of the four possible 5' alleles and that the 
p.Gly843Asp mutation and the 5' polymorphisms evolved 
independently of each other. 

Tables 2 and 3 also indicate the expected effect of the 
polymorphisms on PEX1 gene expression levels follow- 
ing the analysis of Maxwell et al. [19]. It was found that 
c-137 T > C polymorphism decreased the PEX1 gene 
expression by 50% (J,), whereas the c-53 C > G poly- 
morphism increased PEX1 expression by approximately 
25% (|) when analyzed in a luciferase reporter study 
[19]. A combination of both 5' polymorphisms did not 
lead to a significant change in PEX1 gene expression. 
To evaluate a possible effect of PEX1 5' polymorphisms 
on the severity of the patient phenotype we included the 
5' polymorphisms in a genotype-phenotype analysis. 

Genotype-Phenotype-Correlation 

With respect to the common exonic PEX1 mutations, a 
genotype-phenotype correlation has been described 
[9,12,23,24]. Generally, PEX1 alleles that lead to a PTC 
(P), caused by frameshift or nonsense mutations, lead to 
more severe phenotypes than alleles with a missense 
mutation (M). It is thus possible to classify patients in 
three groups, depending on the combinations of PTC and 
M: MM (type 1), PM (type 2), PP (type 3). In order to 
limit the genotypic classes to three, we included the 
patients with short insertions that do not result in PTC in 
the type 1 group and the more extensive deletions 
together with other deletions resulting in PTCs in the 
PTC group. We undertook this analysis with our patient 



Table 2 Distribution of PEX1 5' polymorphisms in relation to the common c.2097 2098insT mutation 



No. of patients No. of patients C.-137T > C polymorphism C.-53C > G polymorphism c.2097_2098insT genotype 
(this study) [19] 



13 


8 


TT 


CC 




3 


0 


TC (|) 


CC 




5 


1 


TC (|) 


GC (t) 




6 


15 


TC (|) 


GC (t) 


het 


2 


1 


cc (id 


GG (TT) 


horn 



Het and hom refer to the heterozygous and homozygous presence of the c.2097_2098insT genotype, respectively. Our study identified a new genetic 
constellation, that is the potentially expression reducing 137T > C transition in the absence of the C.-53C > G transversion (marked in bold). The arrows refer to 
the expected effect on PEX1 expression [19]. 
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Table 3 PEX1 5' polymorphisms with respect to the most common mutation C.2528G > A (p.Gly843Asp) 



No. of patients 


C.-137T > C polymorphism 


C.-53C > G polymorphism 


C.2528G > A (p.Gly843Asp) genotype 


2 


TT 


CC 


het 


5 


TT 


CC 


horn 


6 


TT 


CC 


- 


2 


TC (|) 


GC (T) 


het 


2 


TC (|) 


GC (t) 


horn 


7 


TC (|) 


GC (t) 




3 


TC (|) 


CC 


het 


0 


TC (|) 


CC 


horn 


0 


TC (|) 


CC 




1 


CC(U) 


GG (TT) 


het 


0 


CC(U) 


GG (TT) 


horn 


2 


CC(U) 


GG (TT) 





Het and hom refer to the heterozygous and homozygous presence of the C.2528G > A genotype, respectively. The C.2528G > A allele is found in combination 
with all four 5' alleles. The arrows refer to the expected effect on PEX1 expression levels. 



cohort and plotted the time of survival as a measure of 
phenotypic severity on the y-axis (Figure 2). 

This representation confirms that the type 3 genotype 
results in the most severe phenotype with an average sur- 
vival of only 4.7 months (six patients). This group also 



Survival 
(months) 



12 



5' Polymorphism 

£ -137 TT -53 CC 
-137 TC -53 GC 
0 -137 CC -53 GG 
Q -137 TC -53 CC 

PEX1 Genotype 

M - Missense mutation 
P = Premature termination 



1 4 5 24 I 



Typel 
(MM) 



Type2 
(MP) 



Type3 
(PP) 



PEX1 
genotype 



Figure 2 Genotype-Phenotype-Correlation in PEX1 patients 
with known survival. Patients were grouped according to their 
genotypes into patients with missense mutations (M) on both 
alleles (type 1, MM), with premature termination codons (PTCs, P) 
on both alleles (type 3, PP), or with a combination of a PTC with a 
missense mutation (type 2, MP). To limit the genotypic classes to 
three, we grouped short insertions that do not result in PTC 
together with M, and more extensive deletions together with the P 
alleles. 5' polymorphisms are marked in color. Numbers in circles 
refer to ZS patient numbers as in Table 1. Patients with survival of 
more than 100 months were alive at the time of submission of the 
manuscript. 



includes patient ZS01 with a homozygous deletion of 115 
amino acids. The average survival in the patient group 
with a combination of a missense mutation and a PTC 
(type 2) is 50.8 months (eight patients). This group also 
includes patient ZS19 with a three-base pair insertion in 
combination with the common p.Ile700TyrfsX42 muta- 
tion. Patients with missense mutations on both PEX1 
copies (type 1) show the longest survival, 106 months on 
average (eight patients). The patient group generally 
associated with the longest survival can be divided into 
two groups of survival of less than 40 months or 8.2 
months on average (four patients) and more than 170 
months, or 203 months on average (Figure 2). It has to 
be noted that patients with a survival of more than 100 
months were alive at the time of the submission of this 
manuscript, so the longer survival rates can be prognosti- 
cally used as a lower limit only. 

We also indicated the 5' polymorphisms within each of 
these groups and found that all but one patient of known 
survival in the type 1 group have the common 5' genotype 
c-137 TT c-53 CC. This means that the differentiation in 
survival in the type 1 group is not due to differences in the 
known 5' polymorphisms, and, more generally, that differ- 
ences in 5' polymorphisms are not the cause for the differ- 
ent survival of patients with the same mutations in PEX1. 
On the other side, conclusions that can be drawn from 
this analysis are limited by the absence of a pair of patients 
with functionally different 5' polymorphisms and with the 
same exonic genotype. Extreme differences in survival of 
patients with identical exonic mutations and 5' poly- 
morphisms suggest, however, that these differences cannot 
be attributed to the 5' polymorphisms. 

Conclusions 

Our study suggests that only the PEX1 5' polymorphic 
site at c-53 is part of the 5' UTR while the 
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polymorphism 137 bp upstream of the PEX1 start codon 
might be part of the PEXI promoter. The common 
C.2528G > A (p.Gly843Asp) mutation appears to be 
independent of a specific upstream polymorphism in 
contrast to the second most common mutation, 
c.2097_2098insT (p.Ile700TyrfsX42), that arose on the 
basis of the -137 T > C c-53 C > G polymorphic con- 
stellation. A novel genotype-phenotype representation 
with only three genetic categories allows distinguishing 
groups of largely differing patient survival, but 5' poly- 
morphisms do not seem to further modulate patient 
survival. We could not yet, however, identify patients 
with identical PEXI mutations and functionally different 
5' polymorphisms. 

Additional material 



Additional file 1: Primers used in this study Table with list of 
oligonucleotide primers used in this study. 

Additional file 2: Transcription factor binding sites in the PEXI 5' 
upstream region from c.-SOO to c.-l Table with results of an in silico 
analysis of transcription factor binding sites in the PEXI 5' region. 
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